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Abstract — We consider the broadcast phase of a three-node 
network, where a relay node establishes a bidirectional commu- 
nication between two nodes using a spectrally efficient two-phase 
decode-and-forward protocol. In the first phase the two nodes 
transmit their messages to the relay node. Then the relay node 
decodes the messages and broadcasts a re-encoded composition of 
them in the second phase. We consider Gaussian MIMO channels 
and determine the capacity region for the second phase which 
we call the Gaussian MIMO bidirectional broadcast channel. 

I. Introduction 

Future wireless communication systems should provide high 
data rates reliably in a certain area, even if the direct link does 
not have the desired quality due to path loss or shadowing. 
To face this challenge there has been growing interest in 
cooperative protocols where some nodes act as relay nodes to 
guarantee a closed coverage by multi-hop communication. In 
this work, we consider a three-node network, where one relay 
node establishes a bidirectional communication between the 
two other nodes. The problem of the two-way communication 
without a relay node is first studied in [1]. 

Since it is difficult to isolate simultaneously transmitted and 
received wireless signals within the same frequency band, we 
assume half-duplex nodes and therefore allocate orthogonal 
resources in time for orthogonal transmission and reception. 
Accordingly, the whole transmission is separated into two 
phases as depicted in Figure [1] In the first phase of a decode- 
and-forward protocol both nodes transmit their information to 
the relay node and in the second phase the relay node decodes 
the messages and broadcasts a re-encoded composition of 
them. Since we do not allow any cooperation between nodes 1 
and 2, this can be seen as a restricted two-way relay channel. 

We assume multiple antennas at all nodes since they can 
increase the capacity of a system significantly [2]. The optimal 
coding strategy for the Gaussian multiple access (MAC) phase 
is well known [3], [4] and extends to the Gaussian MIMO 
case, see for instance [2]. We can assume that the relay node 
can successfully decode the messages u>i £ Wi and W2 6 
W2 from nodes 1 and 2 if we choose the corresponding rate 
pair within the capacity region. In the following bidirectional 
broadcast phase wi is known at the relay node and node 1 and 
W2 is known at the relay node and node 2. It remains for the 
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Fig. 1. Multiple access and broadcast phase of the bidirectional relay channel. 

relay node to broadcast a message which allows both nodes 
to recover the unknown message. 

The bidirectional broadcast phase is analyzed for the dis- 
crete memoryless channel with finite alphabets in [5]. An 
achievable rate region of a compress-and-forward approach, 
where the relay node broadcasts a compressed version of the 
MAC output to both nodes, can be found in [6]. 

In this work, we extend the protocol of [5] to the Gaussian 
MIMO bidirectional broadcast channel. The capacity region 
for this case cannot be given in closed form because of its 
complicated structure. Therefore we use convex optimization 
methods to characterize the boundary of the capacity regionQ 

II. Preliminaries 

Let Nr be the number of transmit antennas at the relay 
node and N k be the number of receive antennas at node k, 
k = 1,2. We define the discrete-time, memoryless Gaussian 
MIMO channels between the relay node and nodes 1 and 
2 respectively as linear time-invariant multiplicative channels 
with additive white Gaussian noise. The vector-valued linear 
input-output relations at one time instant can be expressed as 

y k = H k x + n k , fc=l,2, (1) 

where y k e C NfeXl denotes the output, H k e C JVfcXjv « the 
channel matrix, x 6 C NrX1 the input, and n k G C JVfcXl the 
complex circular symmetric distributed Gaussian noise of the 
channel according to CJV(0, a 2 Jjv fc )■ 

We assume that the input alphabet is continuous so that 
it is reasonable to consider an input constraint. A common 

^Notation: Matrices and random variables are denoted by bold capital 
letters, vectors by bold lower case letters, and sets by calligraphic letters; 

denotes the set of non-negative real numbers and Ml(N, C) the space of 
N X N matrices with complex entries; and (-) H denote inverse and 

Hermitian transpose; IE [■] is the expectation; Q >z means Q is positive 
semidefinite; lim inf and lim sup denote the limit inferior and limit superior. 



and physically meaningful constraint is an average power 
constraint. This means any sequence x%, x 2 , ■-, x n of length 
n must satisfy 



i=l 



(2) 



Definition 1: The Gaussian MIMO bidirectional broadcast 
channel with average power limitation consists of two chan- 
nels between the relay node and nodes 1 and 2 as defined 
in ([B with x E X C C NrX1 , where X describes the set 
of possible input sequences which satisfy the average power 
constraint (f2j), i.e., for a sequence of length n we have X n := 
{( Xl ,x 2 ,...,x n ) G C N «* n : iS=i^ < P}- 

Let W\ and W2 be the independent information sources at 
nodes 1 and 2, which are also known at the relay node. We 
assume that W k is uniformly distributed on the message set 
W k ■■= {1,2, M ( k n) } with n the length of the block code. 
Further, we use the abbreviation V := Wi x W 2 - 

Definition 2: A (M{ , M% ,n)-code for the Gaussian 
MIMO bidirectional broadcast channel with average power 
limitation consists of one encoder at the relay node 

/ : V -> X n 

and decoders at nodes 1 and 2 

gi : C* lXn x W% -f W 2 U{0}, 
ff2 : C^ 2 *™ X W 2 ^ Wi U {0}. 

The element in the decoder plays the role of an erasure 
symbol and is included in the definition for convenience only. 

When the relay node sends the message v — [11)1,11)2], 
the receiver of node 1 is in error if g , i(y™,ti>i) ^ w 2 . 
We denote the probability of this event by Xi{v) := 
P [<?i(y™, wi) ^ W2 I f(v) has been sent]. Accordingly, we 
denote the probability that the receiver of node 2 is in error by 
X 2 (v) := P [92(1/2 i w i) 7^ w i I f( v ) nas been sent]. Now, we 
are able to introduce the notation for the average probability 
of error for the fc-th node 



(n) 



Definition 3: A rate pair , R2] is said to be achievable 
for the Gaussian MIMO bidirectional broadcast channel with 
average power limitation if for any 6 > there is an n(S) G M 



and a sequence of , M^ 71 ' , n)-codes satisfying the power 

log Aff 

constraint such that for all n > n{5) we have — — 1 — > R 2 —S 



r(") 



and 



log — > i?i 5 while ii[ n \ /i 2 ™' 1 — * as n — > cxd. The 
capacity region is the set of all achievable rate pairs which is 
defined as Cbdbc := {[R11R2] £ P+ : [-^1,-^2] achievable}. 

III. Capacity Region 

In this section we present and prove our main result which 
is the capacity region of the Gaussian MIMO bidirectional 
broadcast channel. 



Theorem 1: For given covariance matrix Q with tr(Q) < 
P satisfying the power constraint the corresponding rate pair 
[Ci{Q),C 2 (Q)] is given by 

C k (Q) := logdct^ + ±H k QH%), k = 1,2. 

Then the capacity region Cbdbc of the Gaussian MIMO 
bidirectional broadcast channel is given by 

Cbdbc := (J dpch ( [d (Q), C 2 (Q)]) 

Q: tr(Q)<P, QyO 

where dpch( • ) denotes the downward positive comprehensive 
hull which is defined for the vector x G R\ by the set 
dpch(a;) := {y G : j/i < Xi,i = 1,2}. 

A. Proof of Achievability 

We follow [5] and adapt the random coding proof for the 
degraded broadcast channel of [7] to our context. 

For a given covariance matrix Q with tr(Q) < P satisfying 
the power constraint we have to show that all rate pairs 
[Ri,R 2 ] are achievable which satisfy 

Rk < log det{I Nk +^H k QH%), k = 1,2. 

We denote the achievable rate region as 7?.bdbc- 

1 ) Random codebook generation and encoding: For any 
5 > we have to ensure that the probability that a codeword 
does not satisfy the power constraint goes to zero as the block 
length n goes to infinity. Therefore, we define the covariance 
matrix Q := -pQ with P := P - e P , e P G (0,P], where 
ep allows us to get the rate Rk corresponding to the transmit 
strategy Q arbitrarily close to Rk. Then we define for any Rk 
the rate of the code Rt := max{i \n(R k - |)J , 0}, k = 1, 20 
We generate M{ M 2 codewords of length n with 



M; 



ynB* 



where for each 



r (n) ._ 2 nBi and M (n) 

v = [wi,w 2 ] G V each entry of the corresponding code- 
word f(v) = x n (v) is independently chosen according to 
CA/*(0, Q). We first bound the error probability with respect 
to the codebook which might violate the power constraint. 
Therefore let \ k {v) and [i k , k = 1, 2, be the corresponding 
error probabilities. 

In the following the random variable X denotes an entry 
of the codeword X n and the random variable Yk an entry of 
the output Y 7 k \ k = 1,2. 

2) Decoding: The receiving nodes use typical set decoding. 
Let I{X- Y k ) := E x »,y ; [i(X n ; Y k )], k = 1, 2, denote the 
average mutual information with i(x n ; yJJ) := i log - 
for realizations x n ,y r k l of the random variables X n ,Y k . At 
each receiving node k we have the decoding sets 



S{y n k ) :=\x n EX n :i{x n ;yt)> 



R* k +I(X;Y k ) 



and the indicator function 



d(* n ,v2) 



if x n i S{y n k ) 
else. 



2 If R 1 ^ = 0, the error probability is zero by definition so that we always 
assume i?£ > in the following. 



When x n has been sent, and y™ and y 2 have been received 
at nodes 1 and 2, two different events of error may occur at 
the decoder: the codeword x n is not in S(y k l ) (occurring with 
probability P^ k \v)) and at node one x n (wi,W2) with w 2 ^ 
u>2 is in <S(y™ ) or at node two x n (wi, to 2 ) with wi ^ w\ is in 

( 2) 

5(7/2) (occurring with probability P e fe («)). If there is no or 
more than one codeword jc n (iui, •) G or x n (-, w 2 ) £ 

5(1/2)' the decoders map on the erasure symbol 0. 

3) Analysis of the probability of decoding error: From the 



union bound we have Xk{v) < P e fc («) + P^k( v ) with 
p Ski v )~ I P{vl\x n {v))d(x n {v),yl) dy n k for k = 1,2, 



>(2)/ 



IW2I 

/ P(l/i>») E (l-d(x"(t« 1 ,«> 2 ),y?))dvr. 

(2) 

The error event P e l 2 ; (w) is defined similarly. For uniformly 
distributed messages W\ and W 2 we define P^ := 

For applying the weak law of large numbers we have to 
ensure that the first two moments are finite [8, Section 7.3]. 

Lemma 1: The mean and variance of i(X n ; Y k l ), k = 1,2, 
are finite. Proof: The proof is a generalization of [9, 
Theorem 8.2.2] to the vector-valued case and is omitted here 
for brevity. ■ 
Next, we average over all codebooks and show that 
E X n[P%],-E X n[I*%] -> as n -> oo if R k < I(X;Y k ), 
k = 1,2. Recall that < Rk — § holds so that we have 
1 



^£E X n[P e (>)] 

1 1 »ev 



= E, 



P (y£|x»)d(x»,y£) « 

p(x n )p(yjj|a : n )d(x n ) y2)dy2d a : n 



= E x .,y. [d(x",n)] =p [d(x",n) = 1] 



= p 
< p 



^ + /(x ; n) 



i(*";v)f)< 



</(X;n)- i 







by the law of large numbers since Lemma 1 holds. The fourth 
equality follows from Fubini's theorem. For the calculation 
of Ex"[P e fc] we have to distinguish between the nodes. We 
present the analysis for k = 1, the case k = 2 follows 
similarly. We use the fact that for v = [w\, w 2 ] 7^ [w\, u) 2 ] the 
random variables in p(y r {\X n (v)) and d(X™(w;i, w) 2 ), y") are 
independent for each choice of y™ G C™. 



r |w 2 | 
• /c " * a =i 



to 2 



|W 2 | 
' «)2 = 1 

|W 2 



E x »[p(y? E X »[1 - d(X n ( Wl ,w 2 ), y?)]dy? 



/ £ p(y?)E X n [l-d(X"( Wl ,u, 2 ),y?)]dy? 

W 2 ^W 2 
|W 2 | 

Vp(yl 1 )/ p( : r")(l-d( : r"K> 2 ),y5 l ))d : r"dy? 



102 = 1 
U>2#W2 

=(|W 2 |-1)/ / p{x n )p(y?)dx n dy?, 

where in the third equality the change of the order of 
integration follows again from Fubini's theorem. Whenever 

x n G S(y?), we have z(a; n ;yl l ) = 1 log ^gffl > 
±(Pf + I(X-Y 1 )) orp(y?) < p(yj |x™)2"f W+'fr;^)). 
Consequently, 

E x .[P e (2 1 ) ]<|W 2 | / / p(x n )p(y?\x n ) 

x 2 -f (Rt+I(X; Yl )) dx n dy n 

< 2 nj;; 2 -f(Ji;+/(X;r 1 )) = 2 f (^-/(X;^)) 

if Ri < I{X,Yi). The case k = 2 follows immediately so 
that if Pfe < I(X;Yk), k = 1,2, the average probability of 
error gets arbitrarily small for sufficiently large block length n. 

4) Codebook that satisfies the power constraint: Up to now 
some codewords f(v) may violate the power constraint. The 
probability of this event is given by 

Pj%) :=P[I||X»|| 2 >P]. 

Next, for each randomly generated codebook we construct 
a new codebook where we choose for all codewords f(v), 
which do not satisfy the power constraint, the zero sequence 
instead which obviously satisfies the power constraint. We 
upper bound the probability of a decoding error of the zero 
sequence with 1. Then it easily follows 

A fc («) < P e (0) («) + Pj>) + P§(v), k = 1,2. 

Since W\ and W 2 are uniformly distributed, we have P e (0) := 

1,2, so that ^ < P e (0) + P e ( J + P e ( J. Averaging over all 
codebook realizations, we get 



E[^ n) ] <E[Pi°)]+E[P e (1) l 



E[P e (2) l 



:,ki ^ L e,fcJ 

The first term describes the probability of violating the 
power constraint. Thereby i||X™|| 2 is the arithmetic average 
of n independent, identically distributed random variables 
with E[||X|| 2 ] = P. By the weak law of large numbers, 
the arithmetic average converges in probability to P. Since 
P < P, we have E[P e (0) ] -> as n -> 00. Since 
Ex"[P e ( , 1 fe ) ],Ex"[P e (2 fc ) ] as n 00 as well, we have 
00, k = 1, 2. 



(") n 

/ujj. —> as n 



5) Achievable rates: Since Y k i = H k Xi + Nki> 
k = 1,2, and X t ~ CjV(0, Q) with tr(Q) = P and 
ATj-i ~ CAA(0, cr 2 /jv fc ) are independent and multivariate 
normal distributed, it follows that I(X; Y k ) = log det (J jV;, + 
■^H k QH k ) with tr(Q) = P. It exists an e P > such that 

P fe = logdet(/^ + ^H k QH%) 
>\ogdet(I Nk + ^H k QHf) - | = R k - f. 

Finally, we have 

P£ > logdetLZ^ + ^H k QH%) - 5 = R k - 6 

(n) 

while [if.' — > as n — > oo, fc = 1,2, which proves the 
achievability. ■ 

5. Proof of Weak Converse 

We have to show that any given sequence of 



{M[ n \M^ n \ n)-codes with ^"^2 
a covariance matrix Q with tr(Q) < P satisfying the power 
constraint such that 

Pi : = lim inf MogM^ ) < log det(J iVl + Qlff) + o(n°) 

n— »oo 

P 2 : = lim inf MogM 1 (n) < log det [I N2 + X H 2 QH f ) + o(n° ) 

n^oo 

are satisfied. That means we have Cbdbc Q TxLbdbc- 
Lemma 2: For our context we have Fano's inequality 

H{W 2 \Y r l, Wx) < log Af 2 (n) + 1 = ne[ n) 



(») ,» 



there exists 



with 4 n) = log ^' M p + A for n -> 00 as 0. 
Proof: From Y" and Wi node 1 estimates the index W 2 from 
the sent codeword X n (Wi,W 2 ). We define the event of an 
error at node 1 as 



» 



» 



Pi := 



1, if g 1 (Y",W 1 )^W 2 
0, ifffi(y?,Wi) = Wa 



,(«) 



so that we have for the average probability of error 
P [Pi = 1]. From the chain rule for entropies [9, Lemma 
8.3.2] we have 

H(E 1 ,W 2 \Y n ,W 1 )=H(W 2 \Y^,W 1 )+H{E 1 \Y n ,W 1 ,W 2 ) 
= H{E 1 \Y n 1 ,W 1 ) + H{W 2 \E u Y n 1 ,W 1 ). 

Since Pi is a function of W\, W 2 , and Y™, we have 
P(Pi|Y™, Wx,W 2 ) = 0. Further, since Pi is a binary-valued 
random variable, we get P(Pi|Y™, W%) < H{E X ) < 1. So 
that finally with the next inequality 

H(W 2 \Y^W 1 ,E 1 ) 
=P [E 1 = 0}H(W 2 \Y r l,W 1 ,E 1 =0)+ 

P[Pi = l]H(W 2 \Y r l,W 1 ,E 1 = 1) 



<(i - ^ i) )o + M r \o g (Mr - 1) < M r log m 2 i 

we get Fano's inequality for our context. ■ 
With a similar derivation we get iT(Wi|Y"£, W2) < 
A4 n) logMp + 1 = ne 2 n) with e 2 " } = + ± -> 

for n — » 00 as /i 2 ™' > — > 0. 



.(«) 



» 



1/f (n) 



Lemma 3: The rate iP"(Wa) can be bounded as follows 

n 

iff(Wa) < log det (l Nl + X,H X (± £ QjH? ) + e< n) 



00 as Mi — > 0. 



■ .1 (ri) log AT, (n) ,1 n f 

with ei ' = — — 2 — Mi + — — * for n 
Proof: First, we bound the entropy P(W 2 ) as follows 

P(VK 2 ) = fl"(W 2 |Wi) = I(W 2 ; Y1\W l )+H{W 2 \Y r l,Wx) 
< I(W 2 ;Y^\W 1 )+ne ( C ) < I(Wi, W 2 ; Y n )+ne^ 



< I(X n ; Y") 



» 



where the equalities and inequalities follow from the indepen- 
dence of W\ and W 2 , the definition of mutual information, 
Lemma [2] and the chain rule for mutual information. Since 
(Wi,W 2 ), X n , Y™ form a Markov chain, it can be shown 
that the last inequality holds. If we use the definition of mutual 
information and the memoryless property of the channel, we 
can express the inequality as 



» 



H{W 2 ) < (/i(Y?) - h(Y>?\X n )) 

n n 

=JJh(Y u )-h(Y u \Xi)) +ne[ n) =^/(Y xi ; X,)- 



» 



If we divide the inequality by n and use again the definition 
of mutual information we get 



(n) 



\H{w 2 )<iY.( h{ y^- h ( N ^)+'i 



with Yi, = H\Xi + Nu- The random variables Xi and 
iVi, with h(Nii) = logdet(7re cr 2 /^) are independent. It 
follows from the entropy maximization theorem that h(Yu) < 
log det (ne(a 2 lN 1 + HiQ^f)) with equality if we have 
Gaussian input, i.e., X, ~ CAf(0,Qj). Therewith we have 
h(Y u ) - h(N u ) < logdet(J jVl + ^HtQiHf). Finally, 
we get 



\H{W 2 ) < i£ log det {l Nl + ^HxQiH?) -\ 

i=l 

71 

< log det (l Nl + 4, if x (i J2 Qi) H ?) 



Xn) 



where the second inequality follows from the concavity of the 
log det function [10, Theorem 7.6.7]. ■ 
With a similar derivation we get ^H(Wi) < log det (/ jv 2 + 



— — > for n — > 00 as fi 2 
It is clear that Pi 



An) 
£ 2 

4 0. 



with e 



(n) 



n ^2 



Uminf^ooilogM^ < 



lim supyj^oo — log M2 holds. Since W 2 is uniformly dis- 
tributed, we have ^logM^ — ^H(W 2 ) and obtain with 
Lemma [3] 



Pi < lim sup 



logdet(l Nl +^H 1 (^Q t )H?)+4 



(3) 



Next, we define the compact set Q := {Q g M(N R , C) : 
HQ) < P Q h 0} with i £? =1 Q, g since I ^fci Qi >- 
and ££2=ifr(Qi) = tr(!E?=iQJ < ^ hold- This 
implies that there exists a subsequence (ni)i e -^ such that 

n ^27=1 Qi ~^ Q as ni ~ * 00 w ^ tn Q ^ Q- Therewith and 
with the continuity of the logdet we have 



lim sup 

ni — >oo 



= logdet (J^ +± r H 1 QH H ). 



An,) 



(4) 



Combining © and (0]i we obtain i?i < log det (Ini + 
^H t QHf). Using the same subsequence (n/)/ e iN and 
arguments we get R 2 < logdet (J/v 2 + -^H^H^) which 
proves the converse. ■ 

IV. Discussion 

Since the capacity region is convex, we can completely 
characterize Cbdbc by its boundary which corresponds to the 
weighted rate sum optimal rate pairs. Therefore, we introduce 
a weight vector q — [(71,(72] g K-+\{0} an d express the 
weighted rate sum for given q as i?s(Q) = QiRiiQ) + 
q 2 R 2 (Q) with i?i(Q) := logdet(J JVl + ^HxQH?) and 
R%{Q) ■= logdet(Ijv 2 + ±rH 2 QH 2 ). The aim is now to 
find the optimal covariance matrix Q*(q) with tr(Q*(q)) < P 
satisfying the power constraint which maximizes the weighted 
rate sum. This can be expressed as the following optimization 
problem 

max qiRx{Q) + q 2 R 2 {Q) s.t. tr(Q) < P, Q h 0. (5) 
Q 

The Lagrangian for this optimization problem is given by 

L{Q, i i,*) = -q x R x {Q)-q 2 R 2 {Q)- i i(P-Xx{Q))-\r{QV). 

Therewith, the covariance matrix maximizing <(5j for given q is 
uniquely characterized by the Karush-Kuhn-Tucker conditions 

-fiI NR + * = -q x Hf (a 2 I Nl + H X QH?) 1 H 1 

-q 2 H^(a 2 I N2 +H 2 QH^)- 1 H 2 , (6a) 

QhO, P> tr(Q), (6b) 

* t 0, m > 0, (6c) 

tr(Q*) = 0, n(P - tr(Q)) = 0, (6d) 

with primal, dual, and complementary slackness conditions 
(|6bl , (|6cT >, and d6dl > respectively. 

The weighted rate sum optimal rate pairs describe the 
curved section of the boundary and are uniquely characterized 
by d6all-(l6dll. The two endpoints correspond to the cases, where 
q is chosen to optimize one unidirectional rate. More precisely, 
the case qi > 0, q 2 = means we want to maximize the 
rate Ri, for which we can achieve the single-user capacity 
r{ 1] := maxi?i(Q) = log det (J^ + ^H 1 Q (1) H?), where 
the superscript W indicates that the weight vector q is chosen 
to optimize the rate R\. For R 2 this leads to an achievable 
rate R { 2 1} = logdet(Ijv 2 + H 2 Q {1) H" ) . Similar, for 
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Fig. 2. Achievable rate regions for N± = N2 = Nr = 2. 



(2) 

qi = 0, q 2 > we can achieve R 2 ' = logdet(i~jv 2 + 
^H 2 Q^Hf) and R^ = logdet^ + ^HxQ^H"). 

Figure 12 exemplarily depicts the capacity region in compar- 
ison to the achievable rate regions of the superposition coding 
[11] and the XOR coding approach [12], where the optimal 
rate pair is given by 

R{ = R* 2 := maxmin{i? 1 (Q), R 2 {Q)}. 

The optimal unidirectional rate pairs correspond to the 
points A and B in Figure [2] Point C in Figure [2] describes the 
maxmin optimal rate pair, which is the only rate pair where 
the XOR coding approach achieves the capacity. 

V. Conclusion 

In this work, we extended the bidirectional broadcast phase 
for the discrete memoryless channel with finite alphabets of [5] 
to the Gaussian MIMO case and derived the capacity region. 
We showed that there is not a unique rate sum optimal transmit 
strategy. Similar to the Gaussian MIMO MAC the weighted 
rate sum optimal transmit strategy for the bidirectional broad- 
cast phase depends now on the weights of the two rates. 
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